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ABSTRACT 

We formulate the concept of non-linear and stochastic galaxy biasing in the frame- 
work of halo occupation statistics. Using two-point statistics in projection, we de- 
fine the galaxy bias function, 6g(rp), and the galaxy-dark matter cross-correlation 
function, 7^gm(''p), where Vp is the projected distance. We use the analytical 
halo model to predict how the scale dependence of 6g and T^gm, over the range 
0.1h~^ Mpc < rp < 30h~^ Mpc, depends on the non-linearity and stochasticity in halo 
occupation models. In particular we quantify the effect due to the presence of central 
galaxies, the assumption for the radial distribution of satellite galaxies, the richness 
of the halo, and the Poisson character of the probability to have a certain number of 
satellite galaxies in a halo of a certain mass. Overall, brighter galaxies reveal a stronger 
scale dependence, and out to a larger radius. In real-space, we find that galaxy bias 
becomes scale independent, with TZg = 1, for radii r > 1 — 5h~^ Mpc, depending on 
luminosity. However, galaxy bias is scale-dependent out to much larger radii when 
one uses the projected quantities defined in this paper. These projected bias functions 
have the advantage that they are more easily accessible observationally and that their 
scale dependence carries a wealth of information regarding the properties of galaxy 
biasing. To observationally constrain the parameters of the halo occupation statistics 
and to unveil the origin of galaxy biasing we propose the use of the bias function 
rgiji(rp) = bg{rp) /Tlgni{rp) . This function is obtained via a combination of weak grav- 
itational lensing and galaxy clustering, and it can be measured using existing and 
forthcoming imaging and spectroscopic galaxy surveys. 
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1 INTRODUCTION 

According to the current paradigm of structure formation in 
the Universe, galaxies form and reside within dark matter 
haloes which emerge from the (non-linear) growth of pri- 
mordial density fluctuations. In this scenario, one expects 
that the spatial distribution of galaxies traces, to first or- 
der, that of the underlying dark matter. However, due to 
the complexity of galaxy formation and evolution, galaxies 
are also expected to be somewhat biased tracers. In gen- 
eral, the relationship between the galaxy and dark matter 
distribution is loosely referred to as galaxy biasing (see e.g. 
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iDavis et al.lll985l : iBardeen et alll 19861 : iDekel fc Ree3ll987l '). 
At any cosmic epoch, this relation is the end product of pro- 
cesses such as non-linear gravitational collapse, gas cooling, 
star formation, and possible feedback mechanisms. There- 
fore, understanding the features of galaxy biasing is crucial 
for a thorough comprehension of galaxy formation and evo- 
lution as well as for the interpretation of those studies which 
use galaxies as tracers of the underlying dark matter distri- 
bution in an attempt to constrain cosmological parameters. 
The simplest way to model galaxy biasing is by assum- 
ing a linear and deterministic relation between the matter 
and the galaxy density fields. Galaxy formation, though, is 
a complicated process and the validity of such a simplistic 
assumption is highly questionable. As a consequence, nu- 
merous authors have presented various arguments for con- 
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sidering modifications fro m a simple linear and determin- 



istic biasing schem e (e.g., iKaiserl 11984: iDavis et al. 



Bardocn et al. 1986; Dckcl & Silk 1986: 



Dekel & Rees 



198£; 



1987 



Braun ot al. 1988; Babul fc White 1991; Lahav fc Saslaw 



19921: IScherrer fc Weinberelll998l: iTegmark fc Peebles! 1 19981 : 



Dekel fc Lahavl Il999l : iTaruva fcSodal I1999I I. In addition, 
cosmological simulations and semi-analytical models of 
galaxy formation strongly suggest that galaxy bias in- 



deed takes on a non-trivial form (e.g., Cen fc Ostriken 



19921: iKauffmaiin et al.l Il997l : iTegmark fc BromlevI Il999l : 



SomerviUe et all I2OOII ') 



From the observational side, nu- 
merous attempts have been made to test whether 
galaxy bias is linear and deterministic. This includes 
studies that compare the clusterin g properties of dif- 



fereii t sample s of galaxies (e-K -; Tegmark fc Bromley! 



1999!: lBla nton"2000'; 'Conwav ot al.''2005'; 'Wild ct al."2005' 



Wang et a l. 2007; Swanson ct al. 2008; Zchavi ct al. 2011) 



studie s that measure highe r -order correlation fu n ctions 
(e.g., Frieman fc Gaztanagal 1 1999! : ISzapudi et al.! I2OO2! : 
I Verde et al.! 120021 ), studies that compare observed fluctua- 
tions in the galaxy distribution to matte r fluctuations pre- 
dicted in numeric al simulations (e.g., M arinoni et al.! 12005! : 
iKovac et al.ll201l! ), and studies that combine galaxy cluster- 
ing measuremen ts with grav itational lensing measurements 



mg measuremen ts witti grav itational lensing measurements 
IJHoekstraet al.llio Ol. 2002' : |Pen et al!l2003l : Isheldon et all 



I2OO4! : ISimon et al.i 120071 : IJuUo et al.l I2OI2I ). We refer the 
reader to 311 for a detailed discussion of the pros and cons 
of these different methods. Here we simply mention that the 
majority of these observational studies have confirmed that 
galaxy bias is neither linear nor deterministic. 

Although this observational result is in qualitative 
agreement with theoretical predictions, we still lack a direct 
link between model predictions and actual measurements. 
This is mainly a consequence of the fact that the standard 
formalism used to define (the non-linearity and stochasticity 
of) galaxy bias is difficult to interpret in the framework of 
galaxy formation models. In this paper we introduce a new 
methodology that allows for a more intuitive interpretation 
of galaxy bias that is more directly linked to various concepts 
of galaxy formation theory. In particular, we reformulate the 
parameterization introduced by Dekel fc Lahav (1999; here- 
after DL99) to describe the non-linearity and stochasticity 
of the relation between galaxies and matter in the langauge 
of halo occupation statistics. Since galaxies are believed to 
reside in dark matter haloes, halo occupation distributions 
are the natural way to parameterize the galaxy-dark matter 
connection, and thus the concept of galaxy bias. In addition, 
combined with the halo model, which is an analytical for- 
malis m to des cribe the non-linear clustering of dark matter 
(e.g.. lMa fcFrv 2000 ; Seliak 2000 ), halo occupation models 
can be used to comp ute the n-point statistic s of the galaxy 
distribution (see e.g., ICoorav fc Sheth!l200a ), which can be 
compared with observations. Hence, while the formalism in 
DL99 was not designed to identify the sources of stochas- 
ticity and non-linearity in the relation between galaxies and 
dark matter, its reformulation in terms of halo occupation 
statistics holds the potential to unveil the hidden factors 
from which deviations from the simple linear and determin- 
istic galaxy biasing arisqj. In order to demonstrate the po- 



^ In DL99, sources of deviations from the linear and determinis- 



tential power of this methodology, we make extensive use 
of two-point statistics (galaxy-galaxy, galaxy matter, and 
matter-matter correlation functions) to investigate how non- 
linearity and stochasticity in the halo occupation statistics, 
which have intuitive connections with galaxy formation, im- 
pact the observable properties of galaxy bias. 

The study presented in this paper exploits the fact that 
with the advent of large and homogeneous galaxy surveys, it 
has become possible to conduct statistical studies of galaxy 
properties in connection with the assumed underlying dark 
matter distribution. For instance, one has accurate measure- 
ments of the two-point correlation function of galaxies as a 
function of their pr operties, such as l uminosity, morphol- 
ogy and colour (e.g . Guzzo et al. 200C ;_ Norberg et al.!l200ll . 



I2OO2! ; IZehavi et aiTl2005l : IWang et al.!T2007l ') which are rou- 
tinely compared with the clustering properties of simulated 
dark matter haloes of different masses. Alongside with clus- 
tering measurements, gravitational lensing represents a di- 
rect diagnostic of the galaxy-dark matter connection. For 
instance, galaxy-galaxy lensing and cosmic shear (which are 
related to the galaxy-matter and the matter-matter correla- 
tion functions) have rapidly evolved into techniques capable 
of probing the properties of the dark matter distribution 
using galaxies as tracers. There is growing scientific inter- 
est in combining the complementary information obtained 
from weak lensing and galaxy clustering to further constrain 
the properties of the galaxy-dark matter connection and 
in turn the und e rlying cosmological model. For instance, 
iGaztafiaga et al.l (|2011f ) considered three different types of 
probes in their analysis: 1) angular clustering from galaxy- 
galaxy autocorrelation in narrow redshift bins; 2) weak lens- 
ing from shear-shear, galaxy-shear and magnification (i.e. 
galaxy-matter cross-correlation); and 3) redshift space dis- 
tortions from the ratio of transverse to radial modes. The 
combination of such measurements provides a significant 
improvement in the forecast for the evolution of the dark 
energy equation of state and the cosmic growth evolution. 
This improvement comes from measurements of galaxy bias, 
which affects both redshift-space distortion and w eak lensing 
cross - correlations, but in d ifferent ways (see also IYoo et al.! 
I2OO6! : ICacciato et al.!l2009l ). A sinfilar approach can also be 
used to test the validity of general relativity (GR) on cosmo- 
logical scales, provided that the scale-dependence of galaxy 
bias due to stochasticity and non-linearity is sufficiently 
small (e.g.,[Zhang ct al. 2007; Reyes et al. 20i0). 

In light of forthcoming sur veys such as Pan-ST ARRS, 
KIDS, DES, LSST, and EuclicQ (|Laureiis et al.ll201ll ) which 
will produce deep imaging over large fractions of the sky and 
thereby yield statistical measurements with unprecedented 
quality, it is mandatory to improve our understanding of 
galaxy bias which currently limits our ability to exploit the 
potential of such surveys. To this aim, the approach under- 



tic biasing scheme are repeatedly referred to as "hidden factors 
affecting galaxy formation" . 

^ Pan-STARRS:Panoramic Survey Telescope & Rapid Response 
System, |http://pan-starrs.ifa.hawaii.edu| 

KIDS: KIl o-Degree Survey, 

http://www .astro-wise.Org/projects/KIDS/l 
DES; Dark Energy Survey, https://www.darkenergysurvey.org; 
LSST: Large Synoptic Survey Telescope, http://www.lsst.org, 
Euclid: http://www.euclid-ec.orgl 



Galaxy Biasing in the Halo Model 3 



taken in this paper consists of introducing a method which 
predicts features due to galaxy biasing observable via a com- 
bination of galaxy clustering and galaxy lensing measure- 
ments. One spin-off of this paper is a characterisation of the 
length scale above which it is safe to assume that galaxy 
bias is scale-independent. This is important for a number 
of cosmologic al studies, su c h as t esting GR via the method 
advoc ated bv lZhang et al.l (|2007|) and used bv iReves et al.l 
(l2010D . 

This paper is organized as follows. In §2 we re- visit the 
classical concepts of galaxy biasing and we also formulate it 
in the context of the halo model. In §3 we present a realistic 
description of the halo occupation statistics and we describe 
how its assumptions affect the value of galaxy bias param- 
eters. In §4 we introduce the galaxy bias functions bg, 7?.gm 
and Fgni in terms of two-point statistics. In §5 we analyze 
different scenarios of the way galaxies may populate dark 
matter haloes highlighting how these scenarios translate in 
distinct features in the scale dependence of the galaxy bias 
functions. In §6 we comment on existing observational at- 
tempts to constrain galaxy bias. In §7 we discuss our results 
and draw conclusions. Throughout this paper, we assume a 
flat ACDM cosmology specified by the following cosmolog- 
ical parameters: (f2m, cr8,n, ft) = (0.24,0.74,0.95,0.73) sup- 
ported by the results of the third ye ar of the Wilkinson Mi- 
crowave Anisotropy Probe fWMAP. [Spergel et "aLll2007l ). 



where fig is the average number density of galaxies and pm 
is the dark matter background density. Both these fields are 
smoothed with a smoothing window which defines the term 
'local'. Galaxy bias is said to be linear and deterministic if 



^g(x) = 6g5m(x), 



(2) 



where 6g is referred to as the galaxy bias parameter. Clearly, 
such a biasing scheme is highly idealized. As emphasized in 
DL99, the assumption of linear deterministic biasing must 
brake down in deep voids if 6g > 1 simply because (5g > 
— 1. Furthermore, numerical simulations have shown that the 
bias of dark matter haloes is both non-linear and stochastic 
fe.g.. IMo fc Whitelll996l : ISomerville et al.ll200ll l. Hence, it 
is only natural that galaxies, which reside in dark matter 
haloes, also are biased in a non-linear and stochastic manner. 
Indeed, simulations of galaxy formation in a cosmological 
context suggest a biasing relation t hat is non-linear , scale - 
dependent and stochastic (see e.g. ISomerville et al.ll200ll ). 
Here scale-dependence refers to the fact that the biasing 
description depends on the smoothing scale used to define 
5g and 5rn- 

Based on these considerations, DL99 generalized the 
concept of galaxy bias by considering the local biasing rela- 
tion between galaxies and matter to be a random process, 
specified by the biasing conditional distribution, P{6g\5m), 
of having a galaxy overdensity 5g at a given 5rn- They de- 
fined the mean btasmg function, 6(5m), by the conditional 



2 GALAXY BIASING 



iDekel fc Lahavl (119991 ) introduced a general formalism to 
describe galaxy biasing. In particular, they introduced 
convenient parameters to describe the non-linearity and 
stochasticity of the relation between galaxies and matter. 
A downside of their formalism, however, is that it is based 
on the smoothed galaxy and matter density fields. This 
smoothing blurs the interpretation of the associated two- 
point statistics on small scales (smaller than the smoothing 
length). In addition, the various bias parameters introduced 
in DL99 do not have counterparts that are easily accessible 
from observations, making it difficult to constrain the 
amounts of non-linearity and/or stochasticty as defined 
by DL99. In recent years, a more convenient method for 
describing the bias and clustering properties of galaxies 
has e merged in t he fo rm of halo occ upation statistics 



(e.g. IJing et all Il998l: Se li ak 20 00: 



2001 



1 



Berlind fc We inberg! |2002|: 



Collister fc Lahav 2005|; Ivan den Bosch e t al. 



Peacock fc SmithI 
JYang et al.l _|2003l : 



aD_f2 
200^ 



In 

this section, after a short recap of the classical description 
of galaxy biasing, we reformulate the DL99 formalism in the 
terminology of halo occupation statistics. This formulation 
allows for a far more direct and intuitive interpretation of 
the concepts of non-linearity and stochasticity. 



2.1 Classical Description 

Let ng(x) and pm(x) indicate the /oca Wensity fields of galax- 
ies and matter at location x, respectively. The corresponding 
overdensity fields are defined as 



5g(x) 



.(x) 



and 



(5m (x) 



Pm(x) 



(1) 



fe(<5m)<5,n = (<5g|(5m) = / 5gP(5g|5m)d5, 



(3) 



The function b{Sm) allows for any possible non-linear biasing 
and fully characterizes it, reducing to the special case of 
linear biasing when b{5m) = 6g is a constant independent 
of Sm- The stochasticity of galaxy bias is captured by the 
random biasing field, e, which is defined by 



(S.lSr.) 



(4) 



and has a vanishing local conditional mean, i.e., (el^m) ~ 0. 
The variance of e at a given Sm defines the stochasticity 
function (e^|(5m), whose average over Sm specifies the overall 
(global) stochasticity of the galaxy field. 



is'} 



\5n.)P{Sn.)d5n 



(5) 



with P{Sm) the probability distribution function (PDF) of 
Sm- As mentioned in DL99, the quantity e serves as an an- 
alytical tool to account for stochasticity without identify- 
ing its sources. The formalism presented in the next section 
gives a natural framework within which the hidden sources 
of stochasticity can be unveiled. In general, stochasticity is 
expected to arise from: i) the relation between dark matter 
haloes and the underlying dark matter density field; and ii) 
the way galaxies populate dark matter haloes. The former is 
better addressed using cosmological N-body simulatio ns (see 
e.g.,iMo fc White 1996; Cat elan et al. 1998; Porcia ni et al.1 
1 19991 : ISheth fc Lemsonlll999l ) and is not the goal of this pa- 
per. Rather, we focus on the second source of stochasticity, 
which we address using the analytical halo model comple- 
mented with halo occupation statistics. 
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2.2 Non-linearity and Stochasticity of Halo 
Occupation Statistics 

We now reformulate the DL99 formalism in the language of 
halo occupation statistics. Rather than using the overdensi- 
ties (5g and S^n of the smoothed galaxy and matter density 
fields, we use two new variables: the number of galaxies in a 
dark matter halo, A'^, and the mass of that dark matter halo, 
M. In particular, the relation between galaxies and dark 
matter is now described by the halo occupation distribution 
P{N\M), rather than the conditional distribution P{5^\5m). 
The equivalent of the mean biasing function, 6(<5m), defined 
in Eq. ((3|, now becomes 






(6) 



where {N\M) is the mean of the halo occupation distribu- 
tion, i.e.. 



{N\M) = ^N P{N\M), 



(7) 



and the factor pm/fig is required on dimensional grounds. 
Following the same nomenclature as in DL99, linear, deter- 
ministic biasing now corresponds to 



TV 



■M, 



(8) 



which yields b{M) = 1. Since A*' is an integer, whereas the 
quantities on the rhs of Eq. ^ are real, it is immediately 
clear that in our new formulation linear, deterministic bi- 
asing is unphysical. Note, though, that this does not imply 
that b{M) = 1 is unphysical; after all, b{M) = 1 can be es- 
tablished by having {N\M} = (ng/pm) M, which is possible 
(in practice). In this case, however, there must be non-zero 
stochasticity. If, on the other hand, there is a deterministic 
relation between A'^ and M, then the bias cannot be linear 
(i.e., b{M) / 1). 

Following DL99, we characterize the function b{M) by 
the moments b and b defined by 



{b{M)M^} 



and 



{b^{M)M^} 



(9) 



Here (...) indicates an average over dark matter haloes, i.e.. 



(A) 



J An{M)dM 
Jn{M)dM 



(10) 



with n{M) the halo mass function, and a'lj = (M^). Galaxy 
bias is linear if b/b — 1. It is straightforward to see that this 
is only possible if b{M) is independent of halo mass. Hence, 
in our new formulation we have that linear bias corresponds 
to halo occupation statistics for which {N\M) oc A'l. 

Motivated by DL99, we define the random halo bias 



eN = N- {N\M) 



(11) 



for which the conditional mean vanishes, i.e., (ejv|M) = 0. 
The variance of ejv for halos of a given mass defines the halo 
stochasticity function 



tI{M) 



AM) 



(12) 



where, following DL99, the scaling by a\,j is introduced for 
convenience. By averaging over halos of all masses, we finally 
obtain the stochasticity parameter 



2 



Pm 



{el) 



(13) 



Galaxy bias is said to be deterministic if ah = 0. 

In addition to the bias parameters b and 6, which are 
mass moments of the mean biasing function b{M), one can 
also define other bias parameters. In particular, DL99 intro- 
duced the ratio of the variances, 6var = (Sg)/(5'^), which in 
our reformulation becomes 



Pm 



„2 



(M2) 



(14) 



Using Eq. (|ll|l and the fact that (ejv) ~ 0, one finds that 



J,2 £2,2 

Ovar = + ^h- 



(15) 



This equation, which is exactly the same as in DL99, 
makes it explicit that the bias parameter &var is sensitive to 
both non-linearity and stochasticity. Combining Eqs. (|14|l 
and (|15|) we have that 



Us 



b'+al] 



2 



It is useful to compare this to the covariance 



{NM) = !^bait, 

Tig 



(16) 



(17) 



which follows directly from Eqs. ((6]) and ^. Unlike the vari- 
ance {N'^), the covariance has no additional contribution 
from the biasing scatter a^, (see also DL99). 

Finally, we define the linear correlation coefficient 



(NM) 

(JN um 



(18) 



Using Eqs. (|16P - (|17|I . it is straightforward to see that we can 
write 



6 = 6v 



(19) 



Hence, the first moment of the mean bias function b[M) is 
simply the product of the ratio of variances, bvar, and the 
linear correlation coefficient, r. 

Using these parameters, we can now characterize a few 
special cases. As already mentioned above, the discrete na- 
ture of galaxies does not allow for a bias that is both linear 
and deterministic. However, the halo occupation statistics 
can in principle be such that the bias is linear and stochas- 
tic, in which case 



b = b^ b{M) = 1 

(Tb/0 



6,,, = (l + a^)i/2 
r=(l + a^)-/^ 



(20) 



so that fevar > 1, while r — 1/bvar < 1. In the case of non- 
linear, deterministic biasing these relations reduce to 



crb = 



r = 6/fo / 1 



(21) 
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2.3 The Importance of Central and Satellite 
Galaxies 

An important aspect of halo occupation statistics is the split 
of galaxies in two components: centrals and satellites. Cen- 
trals are galaxies that reside at the center of their dark mat- 
ter haloes, whereas satellites orbit around centrals. As we 
will see below, this distinction between centrals and satel- 
lites is the main cause for non-linearity and scale-dependence 
in galaxy bias. In order to gain some insight into how cen- 
trals and satellites seperately contribute to the stochasticity, 
we define their corresponding random halo biases 



iVc-(iVc!M) 



Ns-{Ns\M)^ 



(22) 



where A'c and A'a are the numbers of central and satellite 
galaxies, respectively. The halo stochasticity function for 
centrals is given by 



{si\M) 



J2 {Nc - {N4R4)f P{N,\M) 



{N4M) - {N^My 



(23) 



Hence, we have that, as expected, central galaxies only con- 
tribute stochasticity if {Nc\M} < 1. On the other hand, if 
(A'clM) is unity, then the occupation statistics of centrals 
are deterministic and {e'l\M) = 0. In the case of satellite 
galaxies we have that 



{e!\M) = XI (^= ^ {N,\M)f P{N,\M) 

JVs=0 

= (n!\M) - {N.lMf . 
Introducing the Poisson function 

{N,{N,-1)\M) 



P{M) 



{N,\My 



(24) 



(25) 



which is equal to unity if P(AfsjM) is given by a Poisson 
distribution, we can rewrite Eq. H24p as 



{e!\M} = [P{M) - 1] {N,\Mf + {N,\M} 



(26) 



This makes it explicit that {£s\M) — {Nb\M) if the occu- 
pation statistics of satellite galaxies obey Poisson statistics. 
Finally, the halo stochasticity function for all galaxies (cen- 
trals and satellites combined) can be written as 

{e^\PI) = {N'^\M) - {N\Mf 
= {£^,\M) + (e^lM) + 

2{{N,N,\ M) -{N,\ M) {N,\M)} . (27) 

where we have used that N = N^ + N^. Hence, as long as 
A^c and As are independent random variables, we have that 
{e^|Af) is simply the sum of the halo stochasticity functions 
for centrals and satellites. 



3 A REALISTIC EXAMPLE 

What are the typical values of b, b, ah, &var and r? In or- 
der to answer this question, we use a realistic model for 
the halo occupation statistics, as described by the C ondi- 
tional Luminosity Function (CLF: lYang et alJlJOO^ '). The 



CLF, <I>(L|M), specifies the number of galaxies of luminos- 
ity L that, on average, reside in a halo of mass M. F ollow - 
ing ICoorav fc Milosavlievicl (|2005D and lYang et al.l (I2008D 
we split the CLF in a central and satellite component; 

$(L|Af) = $c(L!A/) + ML\M) . (28) 

We use the CLF parameterization of lYang et al.l 
(2008), inferred from a large galaxy group catalogue 
(Yang et al. 2007) extracted from the SDSS Data Release 4 
( Adelman-McCarthy et al.ll2006l '). In particular, the CLF of 
central galaxies is modelled as a log-normal, 



ML\M) = 



-exp 



(logL- logic)' 



2ai 



/2^1n(10)crcL 
and the satellite term as a modified Schechter function 



,(29) 



'i's{L\M) 



!©"■»-© 



(30) 



Note that Lc, (Jc, (pl, a^ and L* are, in principle, all functions 
of halo mass M. As our fiducia l mode l, we adopt the specific 
CLF model of lCacciato et al.l (|2009l ). with both CTc and Os 
assumed to be independent of halo mass. This model is in 
excellent agreement with the observed abundances, cluster- 
ing, and galaxy-ga laxy lensin g properties of galaxies in the 
SDSS DR4 (see Cacciato et al. 2009) as well as with satellite 
kinematics (jMore et al.ll2009bl ). In other words, this partic- 
ular CLF provides a realistic and accurate description of 
the halo occupation statistics, at least for the cosmology 
adopted here. 

From the CLF it is straightforward to compute the halo 
occupation numbers. For example, the average number of 
galaxies with luminosities in the range Li < L < L2 is 
simply given by 



(AIM) 



$(LlM)dL. 



(31) 



The CLF, however, only specifies the first moment of the 
halo occupation distribution P{N\A'I). For central galaxies, 
(Ac |M) = (Ac I A'/), which simply follows from the fact that 
Ac is either zero or unity. For satellite galaxies, we use that 



{N^\M) = P{M){Ns\Mf + {Ns\M) 



(32) 



where I3{M) is the Poisson function [Eq. (|25|l ]. In what fol- 
lows we limit ourselves to cases in which /3(Af) is indepen- 
dent of halo mass, i.e., /3(Af) = /3, and we treat /3 as a 
free parameter. In our fiducial model we set /3 = 1, so that 
P(As|Af) is a Poisson distribution. 

Fig. [T] shows the parameters b (upper row), b/b (second 
row), r (third row), and log((T^) (bottom row) as functions of 
Li (expressed in "'^Afr — 51og/i, which is the r-band mag- 
nitude K+E corrected to z = 0.1). Throughout we have 
adopted luminosity bins of one magnitude width, so that 
L2 = 2.5L1; hence, the value of & at "'^Afr — 51og/i = — 20 in- 
dicates the first moment of the mean biasing function, b{M), 
for galaxies with -21 < °'^Mr - 5 log ft < -20, etc. The 
solid curves in Fig. [l] show the results for our fiducial model 
(o-c = 0.14; Qs = -1.2; 77 = 1.0 and /? = 1.0), while the 
different columns show models in which we vary only one of 
the CLF parameters, as indicated. The parameter r] in the 
third column is defined by 



-I>(L|Af) = $c(LlAf)/j7 + 'l's(LIAf) , 



(33) 
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Figure 1. The dependence of galaxy bias on halo occupation statistics. From top to bottom, the various panels plot b, the non-linearity 
parameter b/b, the cross correlation coefficient r, and the stochasticity parameter log((T^) always for galaxies with luminosities in the 
range [Li,2.5Li]. Results are plotted as function of Li, expressed as an r-band magnitude that has been K+E corrected to 2 = 0.1. 
From left to right, we each time vary only one parameter (listed at the top) with respect to our fiducial model, which is indicated by the 
black, solid line. The horizontal, dotted line corresponds to linear, deterministic biasing. See eq.|(29j, II30D .I I33I1 . and II25D for the definition 
of (T(,,as, ?j, and /3, respectively. 



and is used to artificially reduce the contribution of centrals, 
simply by increasing 77 with respect to its fiducial value. 
To guide the discussion, Fig. [2] shows the halo occupation 
distributions (HODs) corresponding to the various models 
shown in Fig. [T] Note how increasing CTc broadens the con- 
tribution of centrals, how increasing Oc reduces the slope 



dlog{7V|M)/dlogM, and how increasing -q suppresses the 
contribution of centrals. 



Starting with the upper panels of Figure [T] we see that 
the fiducial model predicts a h{L\) that strongly increases 
with luminosity. In order to understand this behavior, we 
use Eqs. ((6]) and ([9]) to write 
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Figure 2. The halo occupation statistics, (A'^|Af), as function of halo mass, M, for three different magnitude bins (different columns, as 
indicated at the top). From top to bottom we vary (Tc, Os and -q with respect to our fiducial model, which is always indicated as a black, 
solid curve. These are the same HODs as used in Fig. [T] Note how increasing (Tc broadens the contribution of centrals, how increasing 
Oc reduces the slope dlog(Af|Af)/dlog Af , and how increasing t] suppresses the contribution of centrals. 



p^ J {N\M) M n{M) dM 
n^ / M2 n{M) dM 



(34) 



Hence, 6 = 1 if {N\M) = (rig/pm) M, which corresponds to 
b{M) = 1. As evident from Fig. El for brighter bins, {N\M) 
cuts off at higher M (i.e., bright galaxies can only reside in 
massive haloes), and this cut-ofF causes 6 > 1. For fainter 
galaxies, the contribution of central galaxies in the HOD 
becomes more pronounced, resulting in a 'boost' of {N\M) 
at relatively low M (see left-hand panels of Fig. [5}, which 
in turn results in & < 1. This is also evident from the scaling 
with rj: increasing rj suppresses the relative contribution of 
centrals, which in turn causes an increase in b. Changing the 
Poisson parameter only changes the scatter in the number 
of satellites, and therefore has no impact on b (or b), while 
changes in a^ or Os has only a mild impact on b for reasons 
that are easily understood from an examination of Eq. (|34|) 
and Fig. [2] The main message here is that realistic HODs 



differ strongly from the simple scaling {N\M) ex M, such 
that 6 < 1 (6 > 1) for faint (bright) galaxies. 

The second row of panels in Fig. [T] shows that the 'nor- 
malized' non-linearity parameter b/b also differs markedly 
from unity for our fiducial model. As for b, this parame- 
ter is also equal to unity if bias is linear (i.e., if {N\M) — 
(fig/pm) M). Since this is not the case for realistic HODs, 
both b and b will in general differ from unity. At the faint- 
end, b/b is extremely sensitive to the parameter Qg. This is 
easy to understand; as is evident from Fig. [2] the parameter 
Us controls the slope of {N\M) at the massive end, espe- 
cially for fainter galaxies, for which the satellite fraction is 
larger. Models for which the slope dlog{Ai'|M)/dlog M de- 
viates more from unity are more non-linear. In other words, 
b/b is simply a measure for how much {N\AI) differs from 
the linear relation {N\M) oc M. 

The third row of panels in Fig. [T] shows the cross- 
correlation coefficient r. In all cases shown, and for all lumi- 
nosities, the CLF indicates that r < 1. Writing that 
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_ h 
^ ~ b 



1 + 



(f)' 



-1-1/2 



(35) 



we immediately see that r < b/b (where the equahty corre- 
sponds to deterministic biasing). Hence, the fact that r < 1 
simply reflects the fact that, for realistic halo occupation 
statistics, the non-linearity parameter b/b > 1. The decline 
of r at the bright-end is a reflection of stochasticity becom- 
ing more and more important for brighter galaxies. This is 
evident from the bottom panels of Fig. [T] which show that 
(7b increases very strongly with luminosity. In order to un- 
derstand this behavior, it is useful to rewrite Eq. 1)13^ using 
Eqs. ([26l)-(l27|) as 



o"b = 



Ml 



— +AP) 



1/2 



where 



Mi 



M 



n{M) dM 



1/2 



(36) 



(37) 



is some constant, we have assumed that A^c and A's are in- 
dependent random variables, and 

{iVs|M)2 



AP) 



(/?-l)/s 



■ n(M) dM ■ 



ft 



{N^\Mf 



n{M) dM . 



(38) 



Here /c = fie /fig and /s — fis/fig = 1 — /c are the central 
and satellite fractions, respectively, and the average number 
densities fig, fic and fis follow from 



nx= {N^\M}n{M)dM, 



(39) 



where 'x' stands for 'g' (for galaxies), 'c' (for centrals) or 
's' (for satellites). The first term of Eq. (|36p indicates the 
contribution to ah due to shot-noise, i.e., the finite number of 
galaxies (per halo). This term dominates when the number 
density of galaxies is small (i.e., for bright galaxies), in which 

— — 1/2 

case ah oc rig .It is this shot-noise that is responsible for 
driving r — > at the bright end. The second term of Eq. (|36p 
describes the contribution to ah due to specific aspects of the 
halo occupation statistics, as described by the function T{P). 
This function, which is independent of fig, consists of two 
terms describing the contributions due to the possible non- 
Poissonian nature of satellite galaxies (i.e., if /3 7^ 1) and due 
to scatter in the occupation statistics of centrals (i.e., a non- 
zero (Jc). Note that the first term of I is zero for our fiducial 
model, which has /3 = 1. This explains the insensitivitj|j to 
changes in Qs. Changes in CTc only have a very mild impact 
on the stochasticity, while increasing (decreasing) /3 only has 
a significant impact for faint galaxies, simply because the 
l/flg shot-noise term dominates for bright galaxies. Finally, 
the increase of o"b with increasing r) is simply a reflection of 
the fact that an increase in r) reduces the number density of 
(central) galaxies. 



^ Since the number density of galaxies is dominated by centrals, 
changes in the number density of satellites due to changes in as 
have almost no impact on ah- 



To summarize, realistic halo occupation models predict 
a galaxy bias that is strongly non-linear, indicating that re- 
alistic models do not scale as l^N\M) oc M. This is mainly a 
consequence of central galaxies, which dominate the number 
density and for which {Ne\M) never resembles a power-law. 
However, even for satellite galaxies it is important to re- 
alize that (A'sIM) never follows a pure power law; even if 
(A'^s|M) oc M at the massive end, there will be a cut-off 
at low M, reflecting that galaxies of a given luminosity (or 
stellar mass) require a minimum mass for their host halo. 
Such a cut-off by itself is already sufficient to cause b and b 
to deviate significantly from unity. As for the stochasticity; 
this is largely dominated by shot-noise, with halo-to-halo 
scatter, which reflects the second moment of the halo occu- 
pation distribution P{N\M), only contributing signiflcantly 
for fainter galaxies. 



4 TWO-POINT STATISTICS 

The bias parameters defined in ^ are quantities that are 
averaged over dark matter haloes. Given that (virtually) all 
galaxies are believed to reside in haloes, this is a logical and 
intuitive way to define galaxy bias. However, observation- 
ally it is far from trivial to actually measure these quantities 
from data. After all, this requires an observational method 
to identify individual dark matter haloes. In principle, this 
can be achieved using gravitational lensing, but in practice 
this is only possible for massive clusters. An alternative is to 
use a ha lo-based gal a xy gro up finder, such as the one devel- 
oped bv lYang et al.l (l2005al ). However, this method has the 
disadvantage that it uses galaxies to identify the dark matter 
haloes and estimate their masses. Consequently, A'' and M 
are not independent variables, which is likely to cause sys- 
tematic errors. For example, if TV is in one way or the other 
used to estimate M, the halo-to-halo variance in P{N\M), 
and hence the amount of stochasticity, will be underesti- 
mated. 

Therefore, when the goal is to put constraints on galaxy 
bias using observational data, one requires another set of 
bias parameters that do not suffer from these shortcomings. 
Such a set can be defined using two-point statistics, such as 
the galaxy-galaxy and galaxy-matter correlation functions, 
which can be reliably measured from large galaxy surveys 
such as the SDSS. In addition, with the help of the 'halo 
model', which describes the dark matter density field in 
term s of its halo build ing blocks (see e.g.. iCoorav fc ShethI 
12003 : IMo et al.l l2010f ). one can analytically compute the 
galaxy-galaxy correlation function from the same occupa- 
tion statistics, P{N\M), required to compute b, b, &var, r 
and ah- 

Let us define the following two-point correlation func- 
tions: 

Cgg(r) = (5g(x)5g(x + r)) 

^mm(r) = ((5,n(x)5m(x+ r)) 

?gm(r) = (5g(x)5„,(x + r)), (40) 

where (...) represents an ensemble average, r = |r| is the dis- 
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tance between the two locationQ, and the subscripts 'gg', 
'gm', 'mm' refer to 'galaxy-galaxy', 'galaxy-matter', and 
'matter-matter', respectively. 

Using these two-point correlation functions, we now de- 
fine three functions that are sensitive to different aspects of 
galaxy bias: the 'classical' galaxy bias function, 



bTir) 



Cgg(r-) 

Cmm(r) 



(41) 



the galaxy-dark matter cross-correlation coefficient (here- 
after CCC), 



Cgm(r) 



'"^^"KggM*enn.(r)]V2' 

l|Penll 19981 ). and their ratio, 



rgm('-) 



bfir) _ egg(r) 



7^iS(r■) Cgm(r) 



(42) 



(43) 



The reason for introducing FgJ^ is that, contrary to 6gJ^ and 
7?.gJ^ it is independent of the matter- matter correlation func- 
tion, which makes it easier to measure observationally (see 
ISheldon et al.ll2004l ). In what follows we shall loosely refer to 
these functions as 'bias functions', and to their r-dependence 
as 'scale-dependence'. 

Using the ergodic principle, the ensemble average, {...), 
can be written as a volume average, which in turn, under the 
assumption that all dark matter is bound in virialized dark 
matter haloes, is equal to an average over dark matter haloes 
fsee lMo et al.ll201Cl ). Hence, similar to the bias parameters 
defined in gS] the bias functions &g°(r), 7^^m(r), and r™(r) 
are also 'defined' as halo averaged quantities. The advantage 
of defining bias functions via two-point statistics, however, 
is that they can be measured without the need to identify 
individual dark matter haloes. Finally, we emphasize that 
the CCC is not restricted to |7?.(r)| < 1 when computed with 
the analytical halo model because it intrinsically subtracts 
out the shot- noise term in the galaxy correlation (see e.g., 
ISheth fc Leinson 1999 : Seliak 2000 ). 



4.1 Analytical Model 

As we shall see below, all these three bias functions can 
be computed analytically from the halo occupation distri- 
bution P{N\M), making them the natural extension of the 
bias parameters defined in § [2] to the two-point regime. We 
address the observational perspective of these bias functions 
in ^ In this section we focus on investigating what realistic 
models for halo occupation statistics predict for (the scale 
dependence of) 6™, 7^g° and T™. 

In what follows we describe the two-point statistics in 
Fourier space, using power-spectra rather than two-point 
correlation functions. This has the single advantage that 
equations that involve convolutions in real-space can now 
be written in a more compact form. Throughout we assume 
that dark matter haloes are spherically symmetric and have 
a density profile, pm(r|M) = M Uh{r\M), that depends only 
on their mass, A{. Note that J Uh(xjM, 2:) d'^x — 1. Simi- 
larly, we assume that satellite galaxies in haloes of mass M 

* Here we have made the standard assumption that the Universe 
is isotropic. 



follow a spherical number density distribution ns(rjAf) — 
NsUs{r\M), while central galaxies always have r — 0. Since 
centrals and satellites are distributed differently, we write 
the galaxy-galaxy power spectrum as 



Pgg(fc) = f^Pccik) + 2/c/.Pc.(fc) + f^Pssik) , 



(44) 



while the galaxy-dark matter cross power spectrum is given 
by 



-fgm(,n^j — ./c-fcm(ri;j + /s-t sm(,ri;j . 



(45) 



In addition, it is common practice to split two-point statis- 
tics into a 1-halo term (both points are located in the same 
halo) and a 2-halo ter m (the two points are lo cate d in differ- 
enthaloes). Following [Coorav fc Shethll2002l and IMo et all 
I2OIO', the 1-halo terms are 



nc 



Pss\k) = 13 Hl{M) n(M) AM , 
and all other terms are given by 
■Pxy (fc) = / HAM) Hy{M) n{M) dM . 



(46) 
(47) 

(48) 



Here 'x' and 'y' are either 'c' (for central), 's' (for satellite), 
or 'm' (for matter), and we have defined 



Hm{M) = — Ul,{k\M) , 



HciM) 



and 



(A^clM) 



H.(M) = M^fi.(fc|M)^ 



(49) 
(50) 

(51) 



with uii{k\AI) and Us{k\AI) the Fourier transforms of the 
halo density profile and the satellite number density profile, 
respectively, both normalized to unity. The various 2-halo 
terms are given by 



P^yik) = Piin(fc) / dMi •Hx(Afi) 6h(Afi) n(Mi) 



dAf2Wy(M2)6h(M2)n(A/2 



(52) 



where Piin(fe) is the linear p ower spectrum and bh{M,z) is 
the halo bias function (e.g., IMo fcWhit3 119961 ). Note that 
in this formalism, the matter-matter power spectrum simply 

reads -rinm('^) ^ Pmmyk) + PmrnV^j ■ 

The two-point correlation functions corresponding to 
these power-spectra are obtained by simple Fourier trans- 
formation: 



^xy(r) 



2-^/0 '"^ 



_ , , , sin kr , 2 , , 



kr 



(53) 



Throughout this paper we adopt the halo mass functions and 
halo b ias functions of Tinker et al. (20081) and [Tinker et al.l 
(J2OIOI ), respectively. 

We caution the reader that this particular implementa- 
tion of the halo model is fairly simplified. In particular, it 
ignores two important effects: scale dependence of the halo 
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Figure 3. Scale dependence of the galaxy bias functions bi^ , 7^-gm, and FgJ^ for tliree luminosity bins (indicated at the top of every 
column). The reference model is indicated with the black solid lines, whereas other lines refer to a suppression of the central term of the 
HOD by a factor I/77 (see eq. | 33| and discussion in §3). 



bias function and halo-exclusion (i.e., the fact that the spa- 
tial distribution of dark matter haloes is mutually exclu- 
sive). As discussed in Tinker et al. (2005), both effects are 
important on intermediate scales (in the 1-halo to 2-halo 
transition region). Indeed, demonstrated in van den Bosch 
et al. (2012, in preparation), ignoring these effects can cause 
systematic errors in the two-point correlation functions on 
scales of ~ 1 — 2h~^ Mpc that are as large as 50 percent. 
However, detailed tests have shown that the bias functions 
(|4ip - (|43p . which are defined in terms of ratios of these corre- 
lation functions, are much more accurate (with typical errors 
<, 10 percent); i.e., to first order, by taking ratios, one is less 
sensitive to uncertainties in the halo model. This is why it 
is advantageous to use the bias functions, rather than the 
two-point correlation functions themselves, when trying to 
constrain particular aspects of galaxy bias. It is the purpose 
of this paper to explore how the (scale-dependence) of the 
bias functions depend on various properties related to halo 
occupation statistics. 

Before showing predictions based on a specific HOD, we 



can gain some insight from a closer examination of the above 
equations. In particular, it can easily be seen that galaxies 
are only unbiased with respect to the dark matter, at all 
scales (i.e., 6g° — 1), if and only if the following conditions 
are satisfied 

(i) ri — 00, i.e., there are no central gala^cies 

(ii) 13 = 1, i.e., the occupation number of satellite galaxies 
obeys Poisson statistics 

(iii) Us(k\M) = Uh(k\M), i.e., the normalized number 
density profile of satellite galaxies in dark matter haloes is 
identical to that of dark matter particles 

(iv) (A's|M) = — Af, i.e., the occupation number of satel- 
lites is directly proportional to halo mass 

Under these conditions one also has that 7?.g° = FgJJ^ = 1. 
If all the above conditions are satisfied except that there 
is a non-negligble fraction of centrals (i.e., r) is finite), one 
expects strong scale-dependence on small scales due to the 
fact that the location of central galaxies within their dark 
matter haloes is strongly biased. If /3 7^ 1, one still main- 
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Figure 4. Scale dependence of the galaxy bias functions bg, 7?.gm, and Fgni for three luminosity bins (indicated at the top of every 
column). The reference model is indicated with the black solid lines, whereas other lines refer to a suppression of the central term of the 
HOD by a factor l/rj (see discussion in §3). 



tains b^ = 1 on large scales (r ^ ''12), but this will transit 
to b^ = j3 for r <^ ri2 (if r) = 00). Here ri2 is the 1-halo 
to 2-halo transition region, which is roughly equal to the 
virial radius of the characteristic halo hosting the galaxies 
in question. This comes about because the Poisson param- 
eter /3 only enters in the 1-halo satellite-satellite term. If 
Us(fc|M) 7^ u-h_{k\M), once again this will manifest itself as 
a transition from fog° = 1 at r 3> ri2 to 6g° 7^ 1 at r ^ ri2 
simply because Us{k\M) — Uh(k\M) = 1 on large scales (i.e., 
for k ^ l/'"i2). If (A'slA'f) is not proportional to M, which 
as we have seen in ^ is never expected to be the case for 
realistic HODs, one also expects a transition around ri2. 
However, in this case b'^ is not expected to be equal to 
unity at either small or large scales. The exact values of feg° 
depend on how exactly the satellite occupation numbers de- 
viate from linearity, but will be different on small and large 
scales mainly because the 2-halo term weights {A's|Af) with 
the halo bias feh(A^), whereas the 1-halo term doesn't. In 
fact, on large scales, where Us{k\M) = £(h(fc|M) — 1 and 



the matter power spectrum is still in the linear regime, it is 
straightforward to show that 






{Mb(M)&h(M)) 

(M) 



(54) 



where (...) is the average over dark matter haloes given by 
Eq. (|10p and b{M) is the mean biasing function of Eq. (|6]). 
This immediately shows that the large scale bias bg de- 
fined via the correlation functions cannot, in general, be 
expressed in terms of the moments b and 6 of b{AI) (cf. 
Eq. [S]). Only in the case of linear biasing we have that 
b'^ = b(M) = 6 = 6 = 1. To summarize, based on 
all these considerations, one expects scale independence on 
large scales, at a value that depends on the details of the 
HOD, a sudden transition at around the 1-halo to 2-halo 
transition scale, and scale dependence on small scales due 
to the dominance of central galaxies. Finally, it is worth 
pointing out that on the large scales where Eq. (|54|) is valid, 



one always expects that 7?.g 



1 and rl° 
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Figure 3 shows the scale dependence of the biasing func- 
tions defined in Eqs. ((4T|) - (|43)) computed using the ana- 
lytic model outlined above and with the same halo occu- 
pation statistics as in |3] The three columns refer to three 
luminosity bins (expressed in r-band magnitude). Beside 
the reference model (black solid lines), we show four ad- 
ditional model predictions in which the contribution from 
central galaxies is increasingly suppressed as the parame- 
ter 77 changes from unity to 100 (see discusssion in §3). As 
is evident from the upper panels, the galaxy bias, b^ , ex- 
actly reveals the behaviour expected based on the discussion 
above: on large scales the bias is scale-independent, there 
is sudden transition around the 1-halo to 2-halo transition 
scale (which is larger for brighter galaxies, since these reside 
in more massive, and therefore more extended haloes), and 
there is significant scale dependence on small scales. Note 
also that the large-scale bias is larger fo r brighter galaxies . 
This is consistent wi t h obs e rvations (e.g. 



Norberg et all I2001I. I2OO2I : IZehavi et at 



20071 : 



Zehavi et al. 2011 



Guzzo et al.ll2000l: 



20051 : IWang et all 



and is a manifestation of the fact 
that brighter galaxies reside in more m assive haloes, which 
are more strongly clustered (|Mo &: W hite 1996). The sup- 
pression of the contribution from central galaxies (i.e., in- 
creasing rf) has the effect of increasing the value of the bias 
on large scales and it also affects the scale dependence of the 
bias on small scales. The effect is larger for brighter galaxies 
which is a direct consequence of the fact that, for realis- 
tic HODs the fraction of galaxies whic h are centrals is an 
increasing function of luminosity (e.g., Mandelba um et al.l 
I2OO6I : Ivan den Bosch et aLllioOTJ : ICacciato et al. 20(3)^ 

As expected, the CCC, TZ^m, shown in the panels in the 
middle row, is unity on large scales for all three luminosity 
bins, and independent of the value of ry. On small scales, 
however, 7?.g° > 1. This scale dependence is mainly due 
to the central galaxies being located at the centers of their 
dark matter haloes. Indeed, suppressing the contribution of 
central galaxies (i.e., increasing 77) results in a CCC that is 
closer to unity on small scales. Overall, the scale dependence 
of 7?.g° is more pronounced for brighter galaxies. This is a 
consequence of the fact that brighter galaxies reside in more 
massive, and therefore more extended, haloes. Note that our 
model, which is based on a realistic HOD that is in excellent 
agreement with a wide range of data, predicts that for galax- 
ies with magnitudes in the range —18 >°'^ Af,. — 51og/i > 
-19 and -20 >"-^ M^ - 5 log /i > -21 the CCC is very close 
to unity on scales above 



0.2/i~' Mpc and 



0.4ft- 



Mpc, respectively. As we will see below, this is actually a 
very robust prediction, and indicates that if suppression of 
scale-dependence is important, it is in general advantageous 
to use fainter galaxies (but see i]6.2p . For the brightest bin, 
our model predicts that simply reducing the contribution 
from the central galaxies does not lead to 7?.™ — 1 over the 
scale probed here. This is due to the fact that a realistic 
HOD does not have (A'a|M) ex M and even more important 
bright satellite galaxies only form above a large halo mass. 
We have tested that artificially imposing (A'^sjM) oc M and 
no low-mass cut off yields 7?.g° = 1 at all scales of interested 
here also in this brightest bin. 

Finally, because of its definition, the scale dependence 
of the bias function FgJ^, shown in the lower panels, is eas- 
ily understood from a combination of the effects on both 
6gm and 7?.|ni- Our fiducial model predicts that FgJ^ is scale- 



independent on large scales, and decreases with decreasing 
radius on small scales where the 1-halo term of the two- 
point correlation functions dominates. Overall, the scale- 
dependence of FgJ^ is predicted to be larger for brighter 
galaxies. 

The results in Fig. 3 indicate that our analytical model, 
which is based on a realistic HOD, makes some very specific 
predictions regarding the scale dependence of the bias func- 
tions (|41[) -(|43 | ). However, before we embark on a detailed 
study of how these predictions depend on some specific as- 
pects of the halo occupation model used, it is important to 
stress that the results in Fig. 3 are in real-space. Unfortu- 
nately, because of redshift-space distortions and projection 
effects, real-space correlation functions are extremely diffi- 
cult, if not impossible, to obtain observationally. We there- 
fore first recast our bias functions into two-dimensional, pro- 
jected analogues, which are more easily accessible observa- 
tionally. We start by defining the matter-matter, galaxy- 
matter, and galaxy-galaxy projected surface densities as 



Exy(rp) = 2p / [l + ?xy(r)] 



rdr 



Vr 



(55) 



where 'x' and 'y' stand either for 'g' or 'm', and r-p is the 
projected separation. We also define Exy(< r^) as its average 
inside r^, i.e. 



2xy(<rp) 



2 
'P JO 



ljxy(-ft )R dR , 



which we use to define the excess surface densities 



AExy(rp) 



K^-p 



(56) 



(57) 



These can subsequently be used to define the projected, 2D 
analogues of Eqs. (|^ - (P5)l as 



hirp) 



/Vgm(7'pj 



and 



1 gm(''p) 



^Sgg(''p) 



AE„ 



AEgn,(rp) 



^AEgg(rp)AEn,„,(r 



6g(rp) _ AEgg(rp) 



7^gm(rp) AEgm(rp) 



(58) 



(59) 



(60) 



In what follows we shall refer to these as the 'projected bias 
functions'. 

Note that in the case of the galaxy-dark matter 
cross correlation, the excess surface density AEgin(»'p) = 
7t(rp)Ecrit, where 7t(fp) is the tangential shear which can 
be measured observationally using galaxy-galaxy lensing, 
and Ecrit is a geometrical parameter that depends on the 
comoving distances of the sources and lenses. In the case of 
the galaxy-galaxy autocorrelation we can write that 



ASgg(rp) = p 



— / Wp{R') R' dR' - Wp{rp) 



(61) 



from which it is immediately clear that Agg(rp) is straight- 
forwardly obtained from the projected correlation function 
'Wp{rp), which is routinely measured in large galaxy redshift 
surveys. Finally, in the case of the matter-matter autocor- 
relation, the quantity AEinm(7'p) can be obtained observa- 
tionally if accurate cosmic shear measurements are avail- 
able. Since the cosmic shear measurements are the most 
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challenging, the parameter T^ui{rp), which is independent 
of AEmm(?'p), is significantly easier to determine observa- 
tionally than either &g(rp) and/or 7?.gm(rp). In fact, current 
clustering and galaxy-galaxy lensing data from the SDSS is 
already of sufficient quality to allow for reliable measure- 
ments of rgin(''p) for different luminosity bins. The purpose 
of this paper, however, is not to perform such measurements, 
but rather to provide theoretical guidance on how to con- 
strain different aspects of galaxy biasing by exploiting the 
wealth of information encoded in the scale dependence of 
the (projected) galaxy bias functions. 



5 IMPACT OF HALO OCCUPATION 

ASSUMPTIONS ON GALAXY BIASING 

We now investigate how modifications of the halo occupation 
statistics, modelled via the CLF, impact the projected bias 
functions 6g(r), TZgni{r) and rgm(r) defined in Eqs. ((58)) - 
(|60p . We first study modifications regarding the way central 
galaxies occupy dark matter haloes, followed by an indepth 
study of the impact of various aspects of satellite occupation 
statistics. 



5.1 The Relative Contribution of Centrals 

We start by exploring the role of central galaxies by compar- 
ing models in which the relative contribution of centrals is 
progressively suppressed via the parameter rj (see ^. Figure 
4 shows the scale dependence of the projected bias functions 
6g, 7?.gm, Tgni for the same luminosity bins as in Figure 3. The 
bias functions are plotted as a function of the projected ra- 
dius, Tp, covering the range [0.1, 30]h~^ Mpc. Results are 
shown for the reference model {rj = 1, black solid lines) and 
for models with an increasingly lower contribution from cen- 
trals {rj = 2, 5, 10, 100, as indicated). Overall, the trends are 
very similar to those seen in Fig. 3. For instance, the ref- 
erence model once again shows that the magnitude of scale 
dependence increases with luminosity. However, upon closer 
examination some important differences become apparent, 
which arise from projecting and integrating the two-point 
correlation functions, which are the operations performed 
in order to compute the excess surface densities given by 
Eq. [57] • An important difference is that the galaxy bias, 
&g, remains scale dependent out to much larger radii; in the 
highest luminosity bin, significant scale dependence remains 
visible out to ~ 20h~^ Mpc. This is very different from the 
case of 6g° which becomes scale- independent at ~ 5/i~^Mpc, 
independent of the value of rj. This difference is a conse- 
quence of the integration ((56]), which mixes in signal from 
small scales. This mixing also smears out the sharp features 
in the 1-halo to 2-halo transition region that are present 
in their 3D analogues. Hence, although the projected bias 
functions have the advantage that they are observationally 
accessible, their interpretation is less straightforward. Nev- 
ertheless, as we will see below, different aspects of the halo 
occupation statistics impact the projected bias functions in 
sufficiently different ways that they still provide valuable 
insight into the nature of galaxy biasing. 



5.2 Scatter in the Luminosity-Halo Mass Relation 

Throughout this paper, the number of central galaxies with 
luminosity L that reside in a halo of mass M is modelled 
as a log-normal distribution (see Eq. [29] ). whose standard 
deviation, CTc, indicat es the scatter in lumi nosity at a given 
halo mass. Follo wing Cacciato et al.[ ((20091 ) . and motivated 
by the results of (More et al.l ( 2009b|) based on satellite kine- 



matics, we assume t hat ac is independent of halo mass. 
(Cacciato et al.l (2009; obtained ctc — 0.14, which is the value 
we adopt in our fiducial reference model. 

Figure (5[ shows the projected galaxy bias functions for 
the reference model (ctc — 0.14, black solid lines) and for 
two models with tJc — 0.17 (red dotted lines) and ctc — 0.11 
(blue dashed lines), respectively. All other parameters are 
kept unchanged. Only the brightest bin reveals appreciable 
changes in the bieis functions. This is most easily understood 
by examining the upper panels of Fig. (2] which show how 
changes in CTc impact the occupation statistics. For the two 
fainter bins, changes in (Jc only have a very mild impact 
on the HODs. This in turn is a consequence of the fact that 
fainter galaxies reside, on average, in less massive haloes, and 
therefore probe the low mass end of the halo mass function. 
In this regime, the halo mass function is a power law and 
variations in the scatter in <l>c(I/|Af) have little effect on the 
resulting mass of the average halo hosting these galaxies. 
Conversely, brighter galaxies probe the high mass end of the 
halo mass function, which reveals an exponential decline. 
Thus, variations in the scatter in ^c{L\M) strongly affect 
the resulting mass of the average halo hosting these galaxies 
(see upper right-hand panel of Fig. (2}, which reflects itself 
in a change of bg. In particular, increasing (decreasing) a^ 
strongly suppresses (boosts) the bias bg{r). 

An interesting aspect of scatter (i.e., stochasticity) in 
the occupation statistics of central galaxies, is that it im- 
pacts the 2-halo term, i.e., changes in Uc have an impact on 
6g and Fgni, but not 7?.gm, on large scales. This is a conse- 
quence of the fact that changes in ac impact {Nc\M) (see 
discussion in il2.3p . Note that this is not the case for the 
stochasticity of satellite galaxies, which only impacts the 
bias functions on small scales (where the 1-halo term domi- 
nates) . 



5.3 The First Moment of P{Ns\M) 

The first moment, {iVs|M), of P{Ns\M), for any luminosity 
bin [Li,L2], is completely specified by the CLF (see § (3}. 
As shown in Fig. (5J the slope of {Ns\M) is very sensitive to 
changes in the parameter as, which controls the faint-end 
slope of $s{L\M): in general, smaller (i.e., more negative) 
values of Us result in steeper {A's|Af). As such, a^ is a pa- 
rameter that controls the non-linearity of the HOD. Data 
from clusters and galaxy groups typicall y indicate values for 



Qs in the range —1.5 < Os ^ ~9-^ (e.g..(Beiiersbergen et al 
|2002l : (Trentham fc TullvJlJOO^ : (Eke et al.l(2004( : (Yang et al 
2008) 

Figure (S( shows the projected bias functions for the ref- 
erence model (which has as ~ —1.2) as well as for two vari- 
ations with as = —0.8 and —1.6, as indicated. All other 
model parameters are kept unchanged. Note how smaller 
values of as increase (decrease) the bias parameter &g for 
faint (bright) galaxies. For faint galaxies, this is easy to un- 
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Figure 5. Same as in Fig. (4] The reference model is indicated with the black solid lines, whereas other lines refer to models with larger 
(cc = 0.17, red dotted lines) or smaller (ctc = 0.11, blue dashed lines) scatter in the luminosity-halo mass relation (see eq. | 29| and 
discussion in §5.2). 



derstand; a more negative Os results in a steeper (A'sIM), 
which implies that satellites, on average, reside in more mas- 
sive (i.e., more strongly biased) haloes. Since the satellite 
fraction decreases with increasing luminosity, the impact of 
changes in as become smaller for brighter galaxies. 



5.4 The Second Moment of P{Ns\M) 

The CLF does not specify the second moment, {N'^\M), of 
the satellite occupation distribution. Rather, it is often as- 
sumed that P{Nb\M) follows a Poisson distribution 



P{N,\M) = 



A^° exp [-A] 

iVs! 



(62) 



where A = {Ns\M) is the first moment of the distri- 
bution. Recall that for a Poisson distribution all mo- 
ments can be derived from the first moment. In par- 
ticular, {Ns{Ns - l)jM) = {Ns\Mf. The assumption 
that P{Ns\M) is (close to) Poissonian has strong support 



from galaxy group catalogs (e.g., lYang et al.l 120081 ) and 
from numerical simulations, which show that dark mat- 
ter subhaloes (which are belie ved to host sate l lite g alajcies) 
also follow Poisson statistics (jKravtsov et al.ll2004l ). How- 
ever, a number of studies have shown that there may be 
small but significant deviations from pure Poisson statisitcs 



(e.g.. [ Porcia ni et al. 2004; van den Bosch e t al. 2005a 



Giocoh et allboiol : iBovlan-Kolchin et al.ll2O10l : iBusha et al] 
201H ). Hence, we investigate how deviations from Poisson, 
as parameterized via the parameter /3 (see eq. 25), impact 
on the (projected) bias functions. 

Figure [7] shows the projected bias functions for the ref- 
erence model {j3 — 1, corresponding to P{Ns\M) being Pois- 
sonian) as well as for two variations with /3 — 0.5 and 1.5, as 
indicated. All other model parameters are kept unchanged. 
The parameter /? only affects the 1-halo satellite-satellite 
term of the galaxy-galaxy correlation function. This term 
typically has a maximum contribution close to the 1-halo to 
2-halo transition region, which is therefore the radial interval 
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Figure 6. Same as in Fig. [4] The reference model is indicated with the black sohd lines, whereas other lines refer to models with lower 
(as = —1.6, magenta dashed lines) or higher (og = —0.8, blue dot-dashed lines) value of the low mass end power-law index of the CLF 
(see eq.[30] and discussion in §5.3). 



that is most sensitive to changes in /3. Since brighter galax- 
ies reside in more massive (and therefore more extended) 
haloes, changes in /3 impact the (projected) bias functions 
on larger scales for brighter galaxies. Also, since the satel- 
lite fraction increases with decreasing luminosity, the impact 
of changes in /3 is larger for less luminous galaxies. Overall, 
though, changes in /3 of 50 percent only have a fairly modest 
impact on the (projected) bias functions. Since /3 is unlikely 
to differ from unity by more than ~ 20 percent, we conclude 
that potential deviations from Poisson statistics are unlikely 
to have a significant effect on galaxy biasing. 



5.5 The Spatial Distribution of Satellites 

In our fiducial reference model it is assumed that the number 
density distribution of satellites within dark matter haloes 
is identical to that of dark matter particles; i.e., we as- 
sume that Us{k\A{) — Uh{k\M). The dark matter density 



profiles are modelled as NFW (jNavarro et al.l 119971 ) pro- 
files, with a c oncen tration mass relation, Ch(M), given by 
iMaccio et al.l (|2007l ). Whether the assumption that the num- 
ber density distribution of satellite galaxies is well described 
by the same NFW profile, and with the same concentration- 
mass relation, is still unclear. In particular, n umerous studies 
have c ome up with co n flictin g claims (e.g.. iBeers fc Tonr 
198 61: ICarlberg et all 1 19971: Iron der Marel et al.l |20~ 



r3 



Lin et al.' '2004; van den Bosch et al.' '2005b; Yang et ajj 
2005b: Chen 2009: More et al. 2009a : Nierenberg et al. 20lll : 



Watson et al.ll201ll : lGuo et al.ll2012l ). We therefore examine 
the impact of changing the number density profiles of satel- 
lites, which we parameterize via 



/cone — Cs/Ch , 



(63) 



where Cs is the concentration parameter of the satellite num- 
ber density profile. Note that /cone = 1 for our fiducial refer- 
ence model. Figure [8] shows how changes in /cone impact the 
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Figure 7. Same as in Fig. (4] The reference model is indicated with the black solid lines, whereas other lines refer to models with 
(/9 = 1.5, red lines) or smaller (/3 = 0.5, blue lines) Poisson parameter (see eq. [25] and discussion in §5.4). 



larger 



projected bias functions. The various curves correspond to 
/cone ~ 0.5, 1.0 and 2.0, as indicated. All other model param- 
eters are the same as for the reference model. As expected, 
changing the radial number density profile of satellite galax- 
ies only affects the bias functions on small scales where the 
1-halo term dominates. In general, a more centrally concen- 
trated distribution of satellites (i.e., larger /cone) results in a 
larger galaxy bias and smaller CCC on small scales. Since the 
impact of /cone is restricted to smaller scales than most other 
changes in the halo occupation statistics, accurate measure- 
ments of the (projected) bias functions on small scales holds 
excellent potential for constraining the radial number den- 
sity profiles of satellite galaxies. 



6 OBSERVATIONAL PERSPECTIVE 

6.1 Probing the Scale Dependence of Galaxy Bias 

Constraining the non-linearity and stochasticity of galaxy 
bias with observations is a non-trivial task. The main rea- 
son is that it requires measurements of the fiuctuations in 
the matter density (or n-point statistics thereof), which 
are difficult to obtain. In the absence of such informa- 
tion, however, one can still make some progress. For ex- 
ample, one can test for linearity by measuring higher- 
order statistics of the galaxy distribution, s uch as the bi- 
spectrum (e.g.. [Frieman fc Gaztanagalll999l : ISzapudi et al.l 
l2002l : IVerde et al.ll2002l ). This method, though, yields no in- 
formation regarding stochasticity in the relation between 
galaxi es and matter. Alternatively, iTegmark fc BromlevI 
l|l999f) suggested a method to constrain the non-linearity 
and/or stochasticity by comparing different samples of 
galaxies (i.e., different luminosity bins, early-types vs. late- 
types, etc.) Quoting directly from their paper: "If two dif- 
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Figure 8. Same as in Fig. |4] The reference model is indicated with the black solid lines, whereas other lines refer to models with 
higher (/cone = 2, red dotted lines) or lower (/cone = 0.5, blue dot-dashed lines) value of the ratio between dark matter and galaxy 
concentration, /cone = Cm/cg. (see eq. |63l and discussion in §5.5). 



ferent types of galaxies are both perfectly correlated with 
the matter, they must also be perfectly correlated with each 
other". Hence, if one can establish that the correlation be- 
tween the two samples is imperfect, than galaxy bias has 
to be non-linear and/or stochastic for at least one of the 
two samples. Different implementations of this idea have 
been used by a number of authors ( e .g., Blanto: 



Conway et all |2005|; IWild et al.l l2005l: IWang et all 



200C 



2007 



Swanson et al.l 120081 : IZehavi et al.l |201l|) , with the general 
result that bias cannot be linear and deterministic. 

A third method to constrain the non-linearity of galaxy 
bias without dir ect measurements o f the matter fluctuations 
was proposed bv lSigad et al.l ([2000!). Here the idea is to mea- 
sure the probability distribution function (PDF) ViS^) of 
the galaxy field using counts-in-cells measurements, and to 
compare that with (log-normal) models for the PDF P((5ni) 
of mass fluctuations. Under the assumption that the bias 
relation between galaxies and matter is deterministic and 



monotonic, one can use these two PDFs to infer the con- 
ditional mean bias relation {5^\5ra)- This method has been 
applied to the first ep och VIMOS VLT deep sur vey (VVDS; 
iLe Fevre et al.l 120051) bv 'Marinoni ct al.' (2005) and to th e 



zCOSMOS survey (|Lillv et al. 2007) bv Kovac et~all l|201lf ). 
Both studies find clear indications that bias is non-linear. 
However, this method has a number of important short- 
comings: it assumes that bias is deterministic, is cosmology- 
dependent, and, because it has to filter the galaxy distribu- 
tion (typically on scales of 5 — 10/i~^ Mpc) is carries no in- 
formation of galaxy bias on small scales. As we have shown, 
these are the scales that carry most information regarding 
the non-linearity and stochasticity of galaxy bias. 

Arguably the most promosing method to measure non- 
linearity and stochasticity in galaxy bias, which does not re- 
quire any assumptions regarding cosmology or bias, is to use 
a combination of galaxy clustering and gravitational lensing. 
The latter is the only direct probe of the matter distribution 
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in the Universe, and allows for direct measurements of the 
galaxy-matter cross correlation (via galaxy-galaxy lensing) 
as well as the matter auto-correlation (via cosmic shear). 
When combined with the galaxy-galaxy correlation, these 
measurements yield the bias functions 6g(r), TZ^raif) and 
rgm(r) discussed in this paper. As elucidated above, the 
scale dependence of these bias functions holds great promise 
to gain valuable insight into the origin of non-linearity and 
stoc hasticity o f galax y bias. 

iPenet al.l (|2003l ) applied t hese ideas to the VIRMOS - 
DESCART cosmic shear survey (I Van Waerbeke et al.ll2000t ). 
Using deprojection techniques, they computed the 3D power 
spectra for dark matter and galaxies, as well as their cross 
correlation. They find a weak indication of scale dependence 
with 7?.|5^ decreasing with scale, in qualitative agree ment 
with the predictions shown in iJS] Ivan Waerbekd (119981 ) and 
[Schneider ( 1998) proposed a somewhat alternative applica- 
tion of this method based on aperture masses and aperture 
number counts. The aperture mass, Map(0), is defined as 



(120021) ■ which was bas e d on the cosmic shear analysis 



M,p(e)= / dVf/(<^)C(<^): 



(64) 



(|Schneider et al.l 119981 ). where 5^ is the projected mass 
overdensity and U{<p) is a compensated filter, i.e. 
Jd(t>cl>U{i) = and U{(t>) = ior (p > 6. Similarly, for 
galaxies one defines the aperture number count 



iv,p(e)= / d2,^c/(,^)5f (0), 



(65) 



with (Sg"^ the projected galaxy overdensity. The mass au- 
tocorrelation function {M^p{9)) can be derived from the 
observed ellipticity correlation functions, the angular two- 
point correlation function of galax;ies yields {N^p{6)), and 
the ensemble-averaged tangential shear as a function of ra- 
dius around a sample of lenses (galaxy- galaxy lensing signal) 
can be used to derive (A^ap( ^)A?"ap(^)) (|Van Waerbeke et al.l 
|2002| : iHoekstra et al.ll2002h . Relating these variances at a 
given smoothing aperture, 6, one obtains the aperture bias 
functions 



6ap(6l) 



oc 



{Nlpm 



and 



7^ap(6l) 



(jVap(g)Afap(g)) 



(66) 



(67) 



where the constants of proportionality depend in principle 
on the distributi on of galaxies and on the assumed cos- 
mological model (jvan Waerbekelll998l ). although minimally 
within the current un certainties on cosmological parameters 
(|Komatsu et al.ll201ll ). 

The apertur e-based m ethod has been applied to data 
from the RCS (lYee fc Glad dcrs 20q2) in iHoekstra etHI 
(|200ll ) and in combi nation with data from the VIRMOS- 
DESCART survey in IHoekstra et~all (J2002[ ). Their resuhs 
indicate that 6ap and 7?.ap are scale-dependent over scales 



0.1 - 5/i"^ Mpc, but that the ratio Ta 



"ap / '^ap 



nearly constant at Fa 



1.1 over this range. On scales 



of ~ 0.5/!."^ Mpc, they find that 6ap = 0.7lto°ot and 
7^ap ~ 0.59to;o? (68% confidence, assuming a flat ACDM 
cosmology with Q„-i — 0.3). In Fig. (9] we show a re- 
vised version of the analysis published in IHoekstra et al.l 



of I Van Waerbeke et al. |2002l) . However, as discussed in 



I Van Waerbeke et al.l (|2005l ). these data suffered from PSF 
anisotropy that was not corrected for. Figure [9] shows 
the new result s obtained using the same cosmology as in 
IHoekstra et al.l (|2002|). the u n chang ed RCS measurements 
and the I Van Waerbeke et al.l (12005) cosmic shea r result^. 
Compared to the results in IHoekstra et al.l (|2002l ). the scale 
dependence on small scales has been somewhat reduced. 
On scales of ~ 0.5h~^ Mpc, the new analysis suggests 
6ap = 0.96to:o? and 7^ap = 0.74+oll (68% confidence). 

The finding that 6ap(S) and TZn^iO) have similar scale 
dependence so that rap(0) is nearly scale independent (over 
the scales probed) seems at odds with the predictions in 
!j5l However, we caution that the bias parameters based on 
aperture variances are different from the ones investigated in 
this paper, which are based on excess surface densities. Fur- 
thermore, lacking any redshift information, IHoekstra et al.l 
(.2002 ) used a (apparent) magnitude selected sample which 
mixes galaxies of different intrinsic luminosity and differ- 
ent physical scales. Hence, one cannot directly compare our 
model predictions (Figs. I4I8|) with the data in Fig. [^ On the 
other hand, the difference between aperture variances and 
excess surface densities is mainly operational, rather than 
conceptual, and it is difficult to imagine that they would 
predict very different scale dependencies . In that re s pect i t 
is interesting that a more recent study bv lJullo et al.l (|2012h . 
using the same aperture variance analysis, but appli ed to 
data from the COSMOS survey (|Scoville et al.ll2007^ ■ ob- 
tain 6ap(S) and 7i.ap{9) that more closely follow the trends 
shown in Figs. I4I8I We intend to interpret these aperture 
variance data within the context of halo occupation statis- 
tics in a future publication. 

Finally, we emphasize that the galaxy-matter cross cor- 
relation is a first-order measure of the cosmic shear and 
therefore much easier to measure than t he matter auto - 
correlation, which is second-order (see e.g.. lSchneidenll998l ). 
Hence, the bias parameter Fgm can typically be measured 
with significantly smaller error bars than either fog or 7?.gm, 
simply because it does not require the matter-matter corre- 
lation function. As we have shown in this paper, the scale de- 
pendene of Fgm, even the one defined via the projected sur- 
face densities, contains a wealth of information regarding the 
non-linearity and stochasticity of the halo occupatio n statis- 
tics, a nd thus regarding galaxy formation. Sheld on et al.l 
(1200J) used galax;y clustering and galaxy-galaxy lensing data 
from the SDSS in order to measure FgJ^(r) for a large 
sample of ~ 100, 000 galaxies. Using Abel integrals, they 
deproject their data in order to estimate the 3D galaxy- 
galaxy and galaxy-matter correlation functions. The result- 
ing Fg!^(r) is found to be roughly scale-invariant over the 
radial range 0.2h~^ Mpc ;S *■ ^ 6.7h~^ Mpc at a value of 
(1.3 ± 0.2) (r2m/0.27). Note that the galaxies used in this 
measurement cover a wide range in magnitudes, making it 
difficult to compare directly to the 'predictions' in Figs. [J- 
(8] It would be interesting to repeat these measurements us- 
ing narrower luminosity bins extracted from the significantly 



^ Note that sample variance for both measurements has been 
ignored, which implies that the error budget is underestimated 
on large scales (above 10 arcmin). 
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Figure 9. The scale dependence of the galaxy bias (top panel) 
and the cross-correlation coefficient (bottom panel) as measured 
with the aperture mass rechnique in the RCS survey (see discus- 
sion in §6.1), properly acc ounting for PSF anisotropy (following 
IVan Waerbeke et alJlSoOsh . 



larger and more accurate SDSS data samples currently avail- 
able. We advocate to perform such an analysis using the 
projected surface densities (Eg. I57 | ). rather than t he de- 
projected 3D quantities used bv lSheldon et al.l (|200J). 



6.2 Suppressing the Scale Dependence of Galaxy 
Bias 

For some applications, it is desirable to have bias functions 
with as little scale-dependence as possible. As discussed in 
H5.ll because of the integration performed when calculat- 
ing the observable, projected bias functions, signal on small 
scales is mixed-in on large scales. This causes the scale above 
which the bias functions are scale-independent to increase. 
A constraint on the amount of scale-dependence therefore 
means that a large fraction of the data would have to be 
discarded. This can be mitigated, however, by defining alter- 
native bias functions that circumvent mixin g- in signal form 
small scales. For instance. iReves et al.l l|20ld ) used the quan- 
tities 



T,y(rp) = AE„,(rp) - M^ AExy(r^i 



(68) 



which are directly related to the excess surface densities de- 
fined in Eq. (|57p . and where rmin is some fiducial length 
scale. By rewriting Eq. (|68|) as 



Txy(rp) 



T.^y{B!)R'dE! -Exy(rp) 



+ 



' mill 



-^xy l^^min j 



(69) 



it is immediately clear that Txy does not include any con- 
tribution from length scales smaller than rmin. Hence, one 
could opt to define the projected bias functions (|58p - (|60| ) us- 
ing Txy(rp) rather than AExy(rp). fn what follows we shall 
indicate these new bias functions as 6g, 7?.gm and Fgm, re- 
spectively; 

iReves et al.l l|2010l ) used galaxy clustering, galaxy- 
galaxy lensing and peculiar velocities of luminous red galax- 
ies (LRGs) from th e SDSS to measure the probe of gravity 
(! Zhang et ai1l2007l ) 



£G(r-p)= ^ 



(^P) 



/3 Tgg(rp) 



= /(t2m)'7?-gm(rp); 



(70) 



where /3 = f{Q,m)/bg is the redshift distortion parameter 
(not to be confused with the Poisson parameter /?), which 
can be measured from the redshift space correlation function 
(e.g., Teg mark ct al. 200 6), /(Sim) ~ ^^^^ is the logarithmic 
linear growth rate, and 6g is the galaxy biaqj. As long as 
7?-gm('"p) = 1, which can be assured by picking a sufficiently 
large rmin, it is clear that Eg is a direct probe of /(Sim), 
which is sensi tive to modificatio ns of the law of gravity. In 
their analysis. iReves et al.l (|2010l ) adopt rmin ~ 1.5/i~^ Mpc. 

We now use our models to investigate the amount of 
scale dependence in 7?.gm(7'p) for various values of Tmin. 
Fig. [To] plots 7?.gm(rp) — 1 for three different values of rmin 
(different rows), and for three magnitude bins (different 
columns). The solid curve corresponds to our fiducial refer- 
ence model, while other curves correspond to several varia- 
tions with respect to this model discussed in ij5](as indicated 
in the bottom panels). The shaded area indicates the region 
where scale dependence of 7?.gm(rp) affects the measurement 
of Eg at less than five percent. 

In the upper panels we set rmin = 0, for which 7?.gm(rp) 
reduces to 7?.gm(rp). As we have already seen, this CCC 
reveals very strong scale-dependence, especially for bright 
galaxies, making 7?.gm(rp) useless for measuring Eg- For 
rmin = l.bh~^ Mpc (panels in middle row), this scale depen- 
dence is drastically reduced, in particular for the brightest 
galaxies. Note, though, that depending on the exact val- 
ues of Qs, /cone and P the CCC 7?.gm(rp) may still differ 
from unity at the 10 to 20 percent level on small scales 
(rp ~ rmin). However, adopting rmin ~ 3/i~^ Mpc (lower 
panels) yields rgm(rp) = 1 to better than 5 percent accu- 
racy for all luminosity bins, and with very little dependence 
on uncertainties regarding the halo occupation statistics. 
Hence , we conclude that the method used by Reyes ct alj 
(|2010l ) succesfully suppresses the scale-dependence of galaxy 
bias. For rm i n = 1. 5fe~^ Mpc, which is the value adopted by 
iReves et all l|2010l ) in their analysis of LRGs in the SDSS, 
our models suggest, though, that there may be some resid- 
ual scale dependence on small scales at the 10-20 percent 
level, depending on detailed aspects of the halo occupa- 
tion statisitics. We find that robustly suppressing scale de- 



^ Not e that our definition of Eg differs from that of lReves et alj 
COlOh by a factor Om,0i which is irrelevant for the discussion 
here. 
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Figure 10. The quantity 7^gm(''p) — 1 for three different values of r^^^ (different rows), and for three magnitude bins (different columns). 
The black solid curve corresponds to our fiducial reference model, while other curves correspond to several variations discussed in Js] (as 
indicated in the bottom panels). The shaded area indicates the region where scale dependence of 7^gni(''p) yields values close to unity 
within 5%. 



pendence in TZg^nir^) to better than five percent requires 
r-min > 3/1"' Mpc. 

Finally, we emphasize that the bias functions 6g(rp), 
7?-gm(»'p) and rgni(rp) should only be used if one is interested 
in suppressing scale-dependence. If, on the other hand, one is 
interested in using two-point statistics to unveil the nature 
of galaxy bias, which is the main goal of this paper, one 
should use the projected bias functions ((58)l -((60 )) instead. 



7 SUMMARY AND CONCLUSIONS 

Studying galaxy bias is important for furthering our under- 
standing of galaxy formation, and for being able to use the 
distribution of galaxies to constrain cosmology. Despite a 
large number of (both observational and theoretical) studies, 
we still lack a useful framework for translating constraints on 
galaxy biasing as inferred from observations into constraints 
on the theory of galaxy formation. 

In an attempt to improve this situation, we have refor- 



mulated the parameterization of non-linear and stochastic 
biasing introduced by Dekel & Lahav (1999; DL99) in the 
framework of halo occupation statistics. The bias parame- 
ters introduced by DL99 relate the smoothed density field, 5g, 
to the smoothed matter field, 5^. This smoothing, however, 
has a number of shortcomings. First, there is considerable 
loss of information on small scales (below the filtering scale), 
which, as we have shown in this paper, carry most informa- 
tion regarding galaxy bias. Second, the smoothing procedure 
is extremely sensitivity to (arbitrary) filtering scale: if the 
filtering scale is too large, the data has too little dynamic 
range to probe the non-linearity in the galaxy bias. A filter- 
ing scale that is too small, on the other hand, results in too 
much shot-noise. Finally, the smoothing technique hampers 
an intuitive link to the theory of galaxy formation. 

Since galaxies are believed to form and reside in dark 
matter haloes, it is far more natural, and intuitive, to define 
halo averaged quantities (i.e., to adopt the host halo as the 
'filtering' scale). This automatically suggest a reformulation 
of the DL99 parameterization in terms of halo occupation 
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statistics. Since dark matter haloes are biased entities them- 
selves, such a formulation has the additional advantage that 
the overall bias of galaxies has a natural 'split' in two com- 
ponents: (i) how haloes are biased with respect to the dark 
matter mass distribution, and (ii) how galaxies are biased 
with respect to the haloes in which they reside. The former 
has been addressed in detail with A^-body simulations (e.g. . 
Mo fc White 1996:'Cat elan et al.|[l99i : |Porciani et al.lll999l : 



Sheth fc Lemson 1999 ). The latter has been the focus of this 



paper. 

We have reformulated DL99 by replacing the filtered 
matter overdensity, 5m, with halo mass, M, and the filtered 
galaxy overdensity, 5g, with the occupation number, A'^. 
Within this modified framework, the sources of non-linearity 
and stochasticty in galaxy bias have logical and intuitive 
connections to various aspect of galaxy formation. In par- 
ticular, non-linearity refers to deviations from {N\A'I) oc M, 
while stochasticity refers to scatter in the probability distri- 
bution function P{N\M). Based on our basic understanding 
of galaxy formation, it is essentially impossible to have linear 
galaxy biasing. First of all, since galaxies of a given lumi- 
nosity (or stellar mass) are only expected to form in haloes 
above some minimum mass, one always expects that {N\M) 
has some cut-off at low M. Furthermore, there is no con- 
vincing reason why {N\M} should scale linearly with halo 
mass above this mass scale. In particular, since {N\M} — 
{Nc\M} + {Nc\M), and < {Nc\M) < 1, the contribution 
due to centrals typically gives rise to strong non-linearity. As 
for stochasticity, centrals contribute from the fact that there 
is non-zero scatter in the relation bet ween halo mass an d 
the luminosity of central galaxies (e.g., iMore et al.ll2009a ). 
The contribution from satellites comes from scatter in the 
halo occupation distribution P{Ns\M). Finally, an addi- 
tional source of stochasticity may come about if A'^c and A's 
are not independent random variables. 

A powerful method to probe the non-linearity and 
stochasticity of galaxy bias is via two-point correlation func- 
tions. In particular, non-linearity and stochasticty mani- 
fest themselves as scale dependence in the bias parameter 
b'^{r) = £,gg{r) / ^uim(r) , and the galaxy-matter cross cor- 



relation parameter TZ^{r) 



?gm(r-)/\/Cgg(r)Cmm(r). lu 

this paper we have proposed a modified set of bias pa- 
rameters, bg{rp) and 7?.gm('"p), that are related to excess 
surface densities, rather than real-space correlation func- 
tions. These have the advantage that they can be inferred 
from data in a straightforward manner, without the need 
for complicated, and noise-enhancing, deprojection meth- 
ods. In particular, combining galaxy clustering and galaxy- 
galaxy lensing, it is straightforward to measure the ratio 
rgm(rp) = 6g(rp)/7^gm(T■p). 

Using the halo model and a realistic halo occupation 
model, based on th e conditional luminosity function of 
ICacciato et al.l (|2009l ). we have investigated how deviations 
from the linear and deterministic biasing scheme manifest 
themselves in the scale dependence of 6g, 7?.gm, and Fgrn- 
In particular, we have compared predictions covering the 
spatial scale 0.1 < Tp < 30 Mpc and for galaxies in three 
r-band magnitude bins; [-19,-18], [-21,-20], and [-22.5,-22]. 
These choices aim at bracketing the range of interest for 
current and forthcoming galaxy surveys. 

We have shown that galaxy biasing is scale indepen- 
dent, with 7?.gm = 1, on large scales down to about r ~ 



2 - 5/i"^ Mpc. The exact radius at which 7^™ (and 6™) 
become scale dependent depends on luminosity, with fainter 
galaxies remaining scale independent down to smaller scales. 
This result is robust to all the model variations we have 
performed, but it is only valid in real-space. When using 
the projected bias parameters advocated here, which are 
more easily accessible observationally, the associated quan- 
tity 7?.gm remains scale dependent out to much larger radii 
(as far out as ~ 20h~^ Mpc). This comes about because the 
excess surface densities at projected radius Vp contain in- 
formation from all scales r < rp. This 'scale-mixing' can 
be avoided by using the relative excess s urface densities 
(see iBaldauf et all I2OI0I : I Reyes et al.ll2010l ). which do not 
include any contribution from length scales smaller than 
some fiducial radius rmin- Indeed, our models indicate that 
bias parameters based on these relative excess surface den- 
sities are scale independent at better than 5 percent for 
Tmin > 3h~^ Mpc. For rmin = 1.5/i~^ Mpc, which is the 
value used by iReves et al.l (|20ld ) in their study to test the 
validity of GR, we find residual scale dependence on small 
scales rp ~ rmin of the order of 20 percenllj. 

On small scales (rp ;$, 2 — 5h~^ Mpc), the bias functions 
6g(rp) and 7?.gm(rp) reveal strong scale dependence. The de- 
tailed behavior of &g(rp) and 7?.gm(rp) depends on (i) the 
luminosity of the galaxies in question, with more luminous 
galaxies typically revealing stronger scale-dependence, (ii) 
the detailed behavior of the occupation statistics, {N\M), 
(iii) the scatter in the relation between halo mass and the 
luminosity of central galaxies, ac, (iv) whether the satellite 
occupation distribution P(A's|M) is Poissonian or not, and 
(v) the radial number density distribution of galaxies within 
their host halo. For bright galaxies {°-^Mr - 5 log /i < - 20) , 
the dominant effect giving rise to the scale-dependence of 
7?-gm(rp) is the precence of central galaxies, which occupy 
very biased regions of their host haloes. For fainter galax- 
ies, 7?.gm(rp) is expected to be close to unity, down to 
rp ~ 0.2h~^ Mpc, but with some dependence on the Poisson 
parameter /?. Finally, we stress that 6g(rp) and 7?.gm(rp) for 
the brightest galaxies are extremely sensitive to the amount 
of scatter, ctc, in the relation between halo mass and the 
luminosity of central galaxies. 

We conclude that, since different aspects of halo occupa- 
tion statistics impact the bias functions 6g(rp) and 7?.gm(rp) 
in different ways, there is great promise to unveil the nature 
of galaxy bias from measurements of 6g(rp) and 7?.gm(rp), (or 
from related quantities, such as the aper ture- based bias pa- 
rameters introduced bv lSchneiderl (|l998l ) and Ivan Waerbekd 
( 1998)). Motivated by existing and forthcoming imaging and 
spectroscopic galaxy surveys, we advocate using a combina- 
tion of clustering and galaxy-galaxy lensing to determine the 
ratio of Fgm ~ b^/TZgm, as a function of the spatial scale rp 
for a number of different luminosity (and/o r stellar mass) 
bins. Inspired by the preliminary work of ISheldon et al.l 
I2OO4I . we intend to perform such an analysis in the near 
future, using data from the SDSS. 



^ Since JRevea et alj KOlOh did not use information on these small 
scales, their results are not influenced by this residual scale de- 
pendence. 
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